The 2011 NIST Language Recognition Evaluation

نویسندگان

  • Craig S. Greenberg
  • Alvin F. Martin
  • Mark A. Przybocki
چکیده

In 2011, NIST held the most recent in an ongoing series of Language Recognition Evaluations originating in 1996. The 2011 NIST Language Recognition Evaluation (LRE11) featured 24 languages, including nine languages new to the LRE series, from two different source types, and had participation from 23 research organizations. LRE11 utilized a new evaluation metric, which focused on difficult to distinguish language pairs. The most difficult pairs were generally contained within clusters of linguistically similar languages. For example, the Hindi/Urdu pair and the Lao/Thai pair both proved to be very challenging to distinguish. Pashto and Bengali were found to be confusable with a wide range of languages, and some progress was observed in distinguishing American English from Indian English.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

NIST language recognition evaluation - plans for 2015

We discuss two NIST coordinated evaluations of automatic language recognition technology planned for calendar year 2015 along with possible additional plans for the future. The first is the Language Recognition i-Vector Machine Learning Challenge, largely modeled on the 2013-2014 Speaker Recognition i-Vector Machine Learning Challenge. This online challenge, emphasizing the language identificat...

متن کامل

IIR System Description for the 2011 NIST Language Recognition Evaluation

The Institute for Infocomm Research (IIR) team submitted two systems, namely the primary iir_primary_llr and the contrastive iir_contrast1_llr, to the 2011 NIST Language Recognition Evaluation (LRE). Both systems are based on the fusion of multiple classifiers. These classifiers are broadly divided into two groups: acoustic and phonotactic. Included in the submission are the result files: iir1/...

متن کامل

University of the Basque Country (EHU) Systems for the 2011 NIST Language Recognition Evaluation

This paper describes the systems developed by the Software Technologies Working Group (http://gtts.ehu.es) of the University of the Basque Country for the 2011 NIST Language Recognition Evaluation. Four different systems (one primary and three contrastive) were submitted, consisting of a fusion of five subsystems: a Linearized Eigenchannel GMM (LE-GMM) subsystem, an iVector subsystem and three ...

متن کامل

Spoken language recognition in conversational telephone speech and TV broadcast news (GLOSA)

In this brief communication we present the project GLOSA, financed by the Government of the Basque Country for the period 2010-2011. The project has two main technological objectives: (1) creating a suitable infrastructure for the development and evaluation of language recognition technologies; and (2) preparing a competitive language recognition system for conversational telephone speech, whic...

متن کامل

NIST Language Recognition Evaluation – Past and Future

This is a review of the six NIST Language Recognition Evaluations from 1996 to 2011. The evolving nature of the task is described, including the (non-)distinction between language and dialect. The languages/dialects tested are noted, and the challenges of data collection for such evaluations and the collections actually undertaken are reviewed. The performance measures employed are defined, and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012